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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Clayman, Gary L. 

(ii) TITLE OF INVENTION: Methods and Compositions for the 
Diagnosis and Treatment of Cancer 

(iii) NXMBER OF SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Arnold, White and Durkee 

(B) STREET: P.O. Box 4433 

(C) CITY: Houston 

(D) STATE: TX 

(E) COUNTRY: USA 

(F) ZIP: 77210-4433 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: UNKNOWN 

(B) FILING DATE: CONCURRENTLY HEREWITH 

(C) CLASSIFICATION: UNKNOWN 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Highlander, Steven L. 

(B) REGISTRATION NUMBER: 37,642 

(C) REFERENCE/DOCKET NUMBER: INGN:022 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (512) 418-3000 

(B) TELEFAX: (512) 474-7577 


(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2066 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
CAAAACCTAC CAGGGCAGCT ACGGTTTCCG TCTGGGCTTC TTGCATTCTG GGACAGCCAA 
GTCTGTGACT TGCACGTACT CCCCTGCCCT CAACAAGATG TTTTGCCAAC TGGCCAAGAC 
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CTGCCCTGTG CAGCTGTGGG TTGATTCCAC ACCCCCGCCC GGCACCCGCG TCCGCGCCAT 180 

GGCCATCTAC AAGCAGTCAC AGCACATGAC GGAGGTTGTG AGGCGCTGCC CCCACCATGA 240 

GCGCTGCTCA GATAGCGATG GTCTGGCCCC TCCTCAGCAT CTTATCCGAG TGGAAGGAAA 3 00 

TTTGCGTGTG GAGTATTTGG ATGACAGAAA CACTTTTCGA CATAGTGTGG TGGTGCCCTA 360 

TGAGCCGCCT GAGGTTGGCT CTGACTGTAC CACCATCCAC TACAACTACA TGTGTAACAG 420 

TTCCTGCATG GGCGGCATGA ACCGGAGGCC CATCCTCACC ATCATCACAC TGGAAGACTC 480 

CAGTGGTAAT CTACTGGGAC GGAACAGCTT TGAGGTGCGT GTTTGTGCCT GTCCTGGGAG 540 

AGACCGGCGC ACAGAGGAAG AGAATCTCCG CAAGAAAGGG GAGCCTCACC ACGAGCTGCC 600 

CCCAGGGAGC ACTAAGCGAG CACTGCCCAA CAACACCAGC TCCTCTCCCC AGCCAAAGAA 660 

GAAACCACTG GATGGAGAAT ATTTCACCCT TCAGATCCGT GGGCGTGAGC GCTTCGAGAT 720 

GTTCCGAGAG CTGAATGAGG CCTTGGAACT CAAGGATGCC CAGGCTGGGA AGGAGCCAGG 780 

P 

E9 GGGGAGCAGG GCTCACTCCA GCCACCTGAA GTCCAAAAAG GGTCAGTCTA CCTCCCGCCA 840 

U1 TAAAAAACTC ATGTTCAAGA CAGAAGGGCC TGACTCAGAC TGACATTCTC CACTTCTTGT 900 

CO 

p TCCCCACTGA CAGCCTCCCA CCCCCATCTC TCCCTCCCCT GCGATTTTGG GTTTTGGGTC 960 

TTTGAACCCT TGCTTGCAAT AGGTGTGCGT CAGAAGCACC CAGGACTTCC ATTTGCTTTG 1020 

TCCCGGGGCT CCACTGAACA AGTTGGCCTG CACTGGTGTT TTGTTGTGGG GAGGAGGATG 1080 

Yii GGGAGTAGGA CATACCAGCT TAGATTTTAA GGTTTTTACT GTGAGGGATG TTTGGGAGAT 1140 

1 GTAAGAAATG TTCTTGCAGT TAAGGGTTAG TTTACAATCA GCCACATTCT AGGTAGGGGC 1200 

CCACTTCACC GTACTAACCA GGGAAGCTGT CCCTCACTGT TGAATTTTCT CTAACTTCAA 1260 

GGCCCATATC TGTGAAATGC TGGCATTTGC ACCTACCTCA CAGAGTGCAT TGTGAGGGTT 1320 

AATGAAATAA TGTACATCTG GCCTTGAAAC CACCTTTTAT TACATGGGGT CTAGAACTTG 1380 

ACCCCCTTGA GGGTGCTTGT TCCCTCTCCC TGTTGGTCGG TGGGTTGGTA GTTTCTACAG 1440 

TTGGGCAGCT GGTTAGGTAG AGGGAGTTGT CAAGTCTCTG CTGGCCCAGC CAAACCCTGT 1500 

CTGACAACCT CTTGGTGAAC CTTAGATCCT AAAAGGAAAT GTCACCCCAT CCCACACCCT 1560 

GGAGGATTTC ATCTCTTGTA TAGATGATCT GGATCCACCA AGACTTGTTT TAGCTCAGGG 1620 

TCCAATTTCT TTTTTCTTTT tTTTTTTTTT TTTCTTTTTC TTTGAGACTG GGTCTCTTTG 1680 

TTGCCCCAGG CTGGAGTGGA GTGGCGTGAT CTGGCTTACT GCAGCCTTTG CCTCCCCGGC 1740 
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TCGAGCAGTC CTGCCTCAGC CTCCGGAGTA GCTGGGACCA CAGGTTCATG CCACCATGGC 1800 

CAGCCAACTT TTGCATGTTT TGTAGAGATG GGGTCTCACA GTGTTGCCCA GGCTGGTCTC 1860 

AAACTCCTGG GCTCAGGCGA TCCACCTGTC TCAGCCTCCC AGAGTGCTGG GATTACAATT 1920 

GTGAGCCACC ACGTCCAGCT GGAAGGGTCA ACATCTTTTA CATTCTGCAA GCACATCTGC 1980 

ATTTTCACCC CACCCTTCCC CTCTTCTCCC TTTTTATATC CCATTTTTAT ATCGATCTCT 2040 

TATTTTACAA TAAAACTTTG CTGCCA 2066 


(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Lys Thr Tyr Gin Gly Ser Tyr Gly Phe Arg Leu Gly Phe Leu His Ser 
15 10 15 

Gly Thr Ala Lys Ser Val Thr Cys Thr Tyr Ser Pro Ala Leu Asn Lys 
20 25 30 

Met Phe Cys Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Asp 
35 40 45 

Ser Thr Pro Pro Pro Gly Thr Arg Val Arg Ala Met Ala lie Tyr Lys 
50 55 60 

Gin Ser Gin His Met Thr Glu Val Val Arg Arg Cys Pro His His Glu 
65 70 75 80 

Arg Cys Ser Asp Ser Asp Gly Leu Ala Pro Pro Gin His Leu lie Arg 
85 90 95 

Val Glu Gly Asn Leu Arg Val Glu Tyr Leu Asp Asp Arg Asn Thr Phe 
100 105 110 


Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Val Gly Ser Asp 
115 120 125 

Cys Thr Thr lie His Tyr Asn Tyr Met Cys Asn Ser Ser Cys Met Gly 
130 135 140 

Gly Met Asn Arg Arg Pro lie Leu Thr lie lie Thr Leu Glu Asp Ser 
145 150 ' 155 160 

Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala 


A: 85704{1%4_01!.DOQ 


165 170 175 

Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn Leu Arg Lys Lys 
180 185 190 

Gly Glu Pro His His Glu Leu Pro Pro Gly Ser Thr Lys Arg Ala Leu 
195 200 205 

Pro Asn Asn Thr Ser Ser Ser Pro Gin Pro Lys Lys Lys Pro Leu Asp 
210 215 220 

Gly Glu Tyr Phe Thr Leu Gin lie Arg Gly Arg Glu Arg Phe Glu Met 
225 230 235 240 

Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala Gin Ala Gly 
245 250 255 

Lys Glu Pro Gly Gly Ser Arg Ala His Ser Ser His Leu Lys Ser Lys 
260 265 270 

Lys Gly Gin Ser Thr Ser Arg His Lys Lys Leu Met Phe Lys Thr Glu 
P 275 280 285 

•Hi Gly Pro Asp Ser Asp 

tn 290 

C3 
P 

(2) INFORMATION FOR SEQ ID NO: 3: 

w 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 2066 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 


la 


ry 


m 


(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

CAAAACTTAC CAGGGCAACT ATGGCTTCCA CCTGGGCTTC CTGCAGTCTG GGACAGCCAA 60 

GTCTGTTATG TGCACGTACT CTCCTCCCCT CAATAAGCTA TTCTGCCAGC TGGCGAAGAC 120 

GTGCCCTGTG CAGTTGTGGG TCAGCGCCAC ACCTCCAGCT GGGAGCCGTG TCCGCGCCAT 180 

GGCCATCCAC AAGAAGTCAC AGCACTTGAC GGGGGTCGTG AGACGCTGCC CCCACCATGA 240 

GCGCTGCTCC GATGGTGATG GCCTGGCTCC TCCCCAGCAT CTTATCCGGG TGGAAGGAAA 300 

TTTGTATCCC GAGTATCTGG AAGACAGGCA GACTTTTCGC CACAGCGTGG TGGTACCTTA 360 

TGAGCCACCC GAGGCCGGCT CTGAGTATAC CACCATCCAC TACAAGTACA TTTGTAATAG 420 

CTCCTGCATG GGGGGCATGA ACCGCCGACC TATCCTTACC ATCATCACAC TGGAAGACTC 480 

CAGTGGGAAC CTTCTGGGAC GGGACAGCTT TGAGGTTCGT GTTTGTGCCT GCCCTGGGAG 540 


A: 85704(1%4_01!.DOC) 



AGACCGCCGT ACAGAAGAAG AAAATTTCCG CAAAAAGGAA GTCCTTTGCC CTGAACTGCC 600 

CCCAGGGAGC GCAAAGAGAG CGCTGCCCAC CTGCACAAGC GCCTCTCCCC CGCAAAAGAA 660 

AAAACCACTT GATGGAGAGT ATTTCACCCT CAAGATCCGC GGGCGTAAAC GCTTCGAGAT 720 

GTTCCGGGAG CTGAATGAGG CCTTAGAGTT AAAGGATGCC CATGCTACAG AGGAGTCTGG 780 

AGACAGCAGG GCTCACTCCA GCTACCTGAA GACCAAGAAG GGCCAGTCTA CTTCCCGCCA 840 

TAAAAAAACA ATGGTCAAGA AAGTGGGGCC TGACTCAGAC TGACATTCTC CACTTCTTGT 900 

TCCCCACTGA CAGCCTCCCA CCCCCATCTC TCCCTCCCCT GCCTTTTGGG TTTTGGGTCT 960 

TTGAACCCTT GCTTGCAATA GGTGTGCGTC AGAAGCACCC AGGACTTCCA TTTGCTTTGT 1020 

CCCGGGGCTC CACTGAACAA GTTGGCCTGC ACTGGTGTTT TGTTGTGGGG AGGAGGATGG 1080 

GGAGTAGGAC ATACCAGCTT AGATTTTAAG GTTTTTACTG TGAGGGATGT TTGGGAGATG 1140 

P TAAGAAATGT TCTTGCAGTT AAGGGTTAGT TTACAATCAG CCACATTCTA GGTAGGGGCC 1200 

^•4 CACTTCACCG TACTAACCAG GGAAGCTGTC CCTCACTGTT GAATTTTCTC TAACTTCAAG 1260 

m 

EQ GCCCATATCT GTGAAATGCT GGCATTTGCA CCTACCTCAC AGAGTGCATT GTGAGGGTTA 132 0 

P 

|y ATGAAATAAT GTACATCTGG CCTTGAAACC ACCTTTTATT ACATGGGGTC TAGATGACCC 1380 

IS CCTTGAGGTG CTTGTTCCCT CTCCCTGTTG GTCGGTGGGT TGGTAGTTTC TACAGTTGGG 1440 

M 

^1, CAGCTGGTTA GGTTGAGGTA GTTGTCAGGT CTCTGCTGGC CCAGCGAAAT TCTATCCAGC 1500 

I CAGTTGTTGG ACCCTGGCAC CTCAAATGAA ATCTCACCCT ACCCCACACC CTGTAAGATT 1560 

'5 

; 1 

CTATCTCTTG TATAGATGAT CTGGATCCAC CAAGACTTGT TTTAGCTCAG GGTCCAATTT 1620 

CTTTTTTCTT tTTTTTTTTT TTTTTCTTTT TCTTTGAGAC TGGGTCTCTT TGTTGCCCCA 1680 

GGCTGGAGTG GAGTGGCGTG ATCTGGCTTA CTGCAGCCTT TGCCTCCCCG GCTCGAGCAG 1740 

TCCTGCCTCA GCCTCCGGAG TAGCTGGGAC CACAGGTTCA TGCCACCATG GCCAGCCAAC 1800 

TTTTGCATGT TTTGTAGAGA TGGGGTCTCA CAGTGTTGCC CAGGCTGGTC TCAAACTCCT 1860 

GGGCTCAGGC GATCCACCTG TCTCAGCCTC CCAGAGTGCT GGGATTACAA TTGTGAGCCA 1920 

CCACGTCCAG CTGGAAGGGC CTACTTTCCT TCCATTCTGC AAAGCCCTGC TGCATTTATC 1980 

CACCCCACCC TCCACCTGTC TCCCTCTTTT TTTCTTACCC CTTTTTATAT ATCAATTTCT 2040 

TATTTTACAA TAAAATTTTG TTATCA 2066 


m 
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(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Lys Thr Tyr Gin Gly Asn Tyr Gly Phe His Leu Gly Phe Leu Gin Ser 
15 10 15 

Gly Thr Ala Lys Ser Val Met Cys Thr Tyr Ser Pro Pro Leu Asn Lys 
20 25 30 

Leu Phe Cys Gin Leu Ala Lys Thr Cys Pro Val Gin Leu Trp Val Ser 
35 40 45 

Ala Thr Pro Pro Ala Gly Ser Arg Val Arg Ala Met Ala lie His Lys 
50 55 60 

P 

|y Lys Ser Gin His Met Thr Gly Val Val Arg Arg Cys Pro His His Glu 

'^i 65 70 75 80 

in 

CO Arg Cys Ser Asp Gly Asp Gly Leu Ala Pro Pro Gin His Leu He Arg 

P 85 90 95 

iy Val Glu Gly Asn Leu Tyr Pro Glu Tyr Leu Glu Asp Arg Gin Thr Phe 

I 100 105 110 

ru 


m 


Arg His Ser Val Val Val Pro Tyr Glu Pro Pro Glu Ala Gly Ser Glu 
115 120 125 

Tyr Thr Thr lie His Tyr Lys Tyr lie Cys Asn Ser Ser Cys Met Gly 
130 135 140 

Gly Met Asn Arg Arg Pro lie Leu Thr lie lie Thr Leu Glu Asp Ser 
145 150 155 160 

Ser Gly Asn Leu Leu Gly Arg Asn Ser Phe Glu Val Arg Val Cys Ala 
165 170 175 

Cys Pro Gly Arg Asp Arg Arg Thr Glu Glu Glu Asn Phe Arg Lys Lys 
180 185 190 

Glu Val Leu Cys Pro Glu Leu Pro Pro Gly Ser Ala Lys Arg Ala Leu 
195 200 205 

Pro Thr Cys Thr Ser Ala Ser Pro Pro Gin Lys Lys Lys Pro Leu Asp 
210 215 220 

Gly Glu Tyr Phe Thr Leu Lys lie Arg Gly Arg Leu Arg Phe Glu Met 
225 230 235 240 
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Phe Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala His Ala Thr 
245 250 255 


Glu Glu Ser Gly Asp Ser Arg Ala His Ser Ser Tyr Leu Lys Ser Lys 
260 265 270 

Lys Gly Gin Ser Thr Ser Arg His Lys Lys Thr Met Val Lys Lys Val 
275 280 285 

Gly Pro Asp Ser Asp 
290 


(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ACTGCCCAAC AACACCA 


(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
GCCACGCCCA CACATTT 


(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GCCTGTCCTG GGAGAGACCG 


(2) INFORMATION FOR SEQ ID NO: 8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CCCTTAAGCC ACGCCCACAC 


(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CACTGCCCAA CAACACCA 


(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
GCCACGCCCA CACATTT 


(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GGTGCATTGG AACGCGGATT 


(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS:- 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GGGGACAGAA CGTTGTTTTC 


(2) INFORIVIATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
ACGGATTTGG TCGTATTGGG 


(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 
TGATTTTGGA GGGATCTCGC 
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